Draft version May 8, 2009 

Preprint typeset using 1^1^^ style cmulateapj v. 10/09/06 



SPITZER OBSERVATIONS OF A GRAVITATIONALLY LENSED QUASAR, QSO 2237+0305 

Eric Agol"'^, Stephanie M Gogarten^, Varoujan Gorjian^, Amy Kimball""^ 

^ Department of Astronomy, University of Washington, Box 351580, Seattle, WA 98195, agol@astro.washington.edu and 
^ Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, CA 91109 

Draft version May 8, 2009 

- - , ABSTRACT 

' The four-image gravitationally lensed quasar QSO 2237+0305 is microlensed by stars in the lens 

, galaxy. The amplitude of microlensing variability can be used to infer the relative size of the quasar as 

■ a function of wavelength; this provides a test of quasar models. Towards this end, we present Spitzer 

Space Telescope Infrared Spectrograph (IRS) and Infrared Array Camera (IRAC) observations of 
QSO 2237+0305, finding the following: (1) The infrared spectral energy distribution is similar to 
that of other bright radio-quiet quasars, contrary to an earlier claim. (2) A dusty torus model with 
a small opening angle fits the overall shape of the infrared spectral energy distribution well, but the 
quantitative agreement is poor due to an offset in wavelength of the silicate feature. (3) The flux 
ratios of the four lensed images can be derived from the IRAC data despite being unresolved. We find 
that the near-infrared fluxes are increasingly affected by microlensing towards shorter wavelengths. 
(4) The wavelength dependence of the IRAC flux ratios is consistent with the standard quasar model 
in which an accretion disk and a dusty torus both contribute near 1 micron in the rest frame. This is 
also consistent with recent infrared spectropolarimetry of nearby quasars. 
• Subject headings: 

o 

H ' 1- INTRODUCTION 



oo 



o 



^ ■ Radio quiet quasars, or quasi-stellar objects (QSOs), are some of the most luminous objects in the Universe, reaching 
-[^q13-142^^ in the brightest cases; they are also very compact, hence the name "quasi-stellar." Such a large luminosity 
from a compact source cannot be powered by stars, but can be powered by a super-massive black hole at the center of a 
^-H ' galaxy (Lynden-BeU 1969). The black hole creates radiation by accreting gas via an accretion disk near the Eddington 
^ limit. The accretion disk is fed by gas from the surrounding galaxy via a dust and gas torus on parsec scales. This 
widely held picture explains the two most significant features of quasar spectral energy distributions: 1) a broad peak 
' in the optical/ultraviolet due to the accretion disk and 2) a broad peak in th e infrared due to the dusty torus. These 
I two spectral compon ents are commonly referred to as the "Big Blue Bump" (|Shield£lll978f) and the "infrared bump" 
" I , (jSanders et al.llTQSQf ). with comparable luminosities in each. In between these two peaks lies a valley dubbed "the 
one-micron dip." The standard model naturally accounts for the one-micron dip due to the sublimation temperature 
, of dust; the dusty torus is heated by radiation from the accretion disk, but dust cannot exist at temperatures above 
0^ . about 1500 K, causing a cutoff in the emission from the torus that always occurs near 1 micron. This paper presents 
' a novel test of this two-component model using measurements of gravitational microlensing near the one micron dip of 
the high redshift quasar QSO 2237+0305 {zg = 1.695). Near one micron both the accretion disk and dusty torus have 
nearly equal specific luminosity, but very different sizes, so the region near one micron is ideal for testing the standard 
model with microlensing. 



X 



1.1. Background 

QSO 2237+0305 was chosen for this study as it holds several records among gravitationally lensed quasars: it was 
one of the first four -image lenses discovered (jHuchra et al.1119851 ): its lens gala xy has the lowest redshift, zi — 0.0395 
(|Huchra et al.l [19851 ): and it was the first to show gravitational microlensing (jlrwin et al.lll989f) . This last fact is a 
result of the second: a nearby lens galaxy causes a large velocity of the quasar relative to the magnification patterns 
created by stars in the lens galaxy projected onto the source plane; this large relative velocity results in a shorter 
timescale for microlensing. The discovery of microlensing in this system and its short microlensing time scale made it 
a "rosetta stone" for studies of the size of the quasar emission region: the time-dependent microlensing magnification is 
sensitive to the size of the source, effectively resolving the quasar on sub-microarcsecond scales. Larger sources smooth 
over the microlensing mag nification pattern and thus experience smaller and more gradual variations in magnification 
jRefsdal and Stabcll 1991). In unlcnscd quasars, only the spectral energy distribution can be compared to models (e.g. 
[Sanders et al. 1989; Blacs et al. 2001; Malkan 1983), while for QSO 2237+0305 the size as a function of wavelength 
can be compared to models as well, in principle giving much stronger constraints on the emission mechanism. 

Despite this promise, the interpretation of the first optical microlensing events in QSO 2237+0305 were puzzlin g: one 
study showed the inferred size of the Big Blue Bump was consistent with the accret ion disk model j Wambsganss et al.l 
ll99Clf ). while another study showed the size of the emission region was too small (Ranch and Blandford 1991) . The 
latter result led to other newer models which require more theoretical development, e.g. Barvainis (1993): Czern v et al.l 
(|1994[ ). With a much larger data set and more sophisticated analysis of the microlensing lightcurves. iKochanelj (p004f ) 
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TABLE 1 

Summary of Observations. 



Date 



Instrument 



Module 



Integration Time / Exp. Num. of Exp. 



2005-11-17 
2005-11-17 
2005-11-17 
2005-11-17 
2005-11-20 

2005- 11-20 

2006- 06-29 



IRAC 
IRAC 
IRAC 
IRAC 
IRS 
IRS 
IRS 



Long-Low 
Long-High 



Short- Low 



3.6 fim 
4.5 fim 
5.8 fim 
8.0 fim 



1.2 sec 32 

1.2 sec 32 

1.2 sec 32 

1.2 sec 32 

6.29 sec 64 

14.68 sec 128 

60.95 sec 60 



showed that thermal emission from an accretion disk is consistent with the size inferred from microlensing. However, 
microlensing in a sample of gravitationally lensed quasars has led to a different conclusion: the size of the optical/UV 
emission region inferred fro m microlensing is too large compar ed with the size of quasar accretion disk models inferred 
from fitting the SEDs (Poolev et al.ll2007t lMo"rgan et al.ll2007f) . 

This confused state of affairs of microlensing of the Big Blue Bump partly stems from the fact that the absolute 
size is difficult to constrain as there are degeneracies between the mass of the microlenses and the sky velocity 
of the quasar relative to the magnification pattern. Howev er, the relative size versus wavele ngth is much easier 
to constrain since it is not as subject to these degeneracies (jWambsganss and Paczvnskil 119911 ). In particular, the 
first results for the wavelengt h-dependent relative siz e seem to be in good agreement with the accretion disk model 
for a different lensed quasar (jPoindexter et al.ll2008l ). although the absolute size is still discrepant. The relative 
size of the optical/ultraviolet /X-ray emission region for QSO 2237-1-0305 is well constrained by microlensing (e.g. 
IWvithe et al.ll2000t lKochanekll2004 lAnguita et al.|[2008f ). t aking advantage of t he long time scale da ta set collected 
by the Optical Gravitational Lensing Experiment (OGLE 'Udals ki et al.l l200^ iWozniak et aT]|2000D . An intensive 
monitoring campaign with the Very Large Telescope (VLT) promises to ^ve a ve ry detailed picture of the relative sizes 
of the Big Blue Bump and broad- line regions as a function of wavelength ( Eigcnbr qd. Courbin. Sluse. Mevlan and Agoll 
l2008HEigenbrod. Courbin. Meylan. Agol. Anguita. Schmidt and Wambsganssi200"8l ). In this paper we will not attempt 
to resolve the absolute size problem, but rather we will argue that the standard two-component model provides good 
agreement with the wavelength dependence of the microlensing flux ratios, adding credence to the standard model. 

The first microlensing study of the infrared bum p was carried out with QSO 2237-1-0305 to distinguish synchrotron 
and dust emission models for the infrared bump ()Agol et al.l l2000l ). The synchrotron emission model provides an 
alternative, albeit less natural, explanation for the infrared bump. The synchrotron emission region responsible for the 
infrared bump has to be compact to avoid self-absorption; thus it should show strong variability due to microlensing. 
On the other hand the dusty t orus model must be extended to avoid sublimation, and thus should vary weakly 
due to microlensing. lAgol et al.l (2000) found that the mid-infrared flux ratios were consistent with no microlensing 
([Schmidt et al. I [19981 ): this despite the fact that the optical source was simultaneously undergoing strong microlensing 
events. These observations ruled out strong microlensing magnification of the mid-infrared emission region, which was 
one of the first clear-cut demonstrations th at the infrared emis sion region in radio-quiet quasars is due to thermal 
emission by dust, not synchrotron emission (jWvithe et al.l[200l . Here we extend these results to observations near 
the 1 micron dip where both the dusty torus and accretion disk contribute to the flux. 



In §2 we discuss the observations and data reductions. Although the primary focus of this paper is on probing the 
relative source size versus wavelength, there are two problems related to the SED that may be addressed with our 
data as well: 1) How similar is the SED of QSO 2237-1-0305 to other quasars and Seyfert galaxies? Groun d-based 
observations indicated that QSO 2237-1-0305 contained hotter dust than any other quasar ([Agol et al.ll2000[ ). while 
the observations presented here show that the ground-based observations at one wavelength were in error. The large 
intrinsic luminosity and high magnification, fi ~ 16, (Schmidt et al. 1998) make this QSO an excellent candidate for 
spectroscopy and allow us to compare the spectrum of a high-redshift quasar with nearby Seyfert galaxies. In §3.1 we 
show that QSO 2237-f 0305 looks very similar to other quasars and Seyfert galaxies. 2) How well does the infrared 
SED match dusty torus models? In §3.2 we show that the overall shape agrees qualitatively, but the quantitative 
agreement is poor. 

In §3.3-3.4 we present the microlensing results and interpretation for QSO 2237+0305, demonstrating that two size 
scales are required to fit the microlensing flux ratios, as predicted by the accretion disk/dusty torus model. In §4 we 
discuss the implications for quasars in general and in §5 we summarize. 



Cycle 2 observations with the Spitzer Space Telescope were awarded for studying QSO 2237+0305 under program 
20475 QSO 2237+0305 {a = 22'*40"'30.2^ 5 = 3°21'31'.'l, J2000 ) was observ ed with both the Infrared Spectrograph 
(IRS: [Houck et al]l2004l ) and the Infrared Array Camera (IRAC: fPazio et all 12004) on Spitzer. A summary of obser- 
vations is presented in Table [TJ Listed integration times are for observations of the QSO only; peak-up observations 
and sky observations are not included. 



1.2. Plan of the paper 



2. OBSERVATIONS 



2.1. IRS 
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Basic Calibrated Data (BCD) images were obtained from the Spitzer archive, pipehne version S14.0.0. QSO 
2237+0305 was observed with the IRS modules Short-Low (SL2, 5.2-8.7 /xm, and SLl, 7.4-14.5 /xm), Long-Low (LL2, 
14.0-21.3 /im), and Long-High (LH, 18.7-37.2 /im), for a full observed wavelength coverage of 5.2-37.2 /im. Rogue 
pixels were eliminated using the IRSCLEAN_MASK software package provided by the Spitzer Science Center. We 
created our own rogue pixel maps (pixels with anomalous behavior) for each spectral order by measuring two quantities 
from a series of sky images for each order (for the long-high data we chose only portions of the image that did not 
contain the target): (1) the scatter in each pixel with time; (2) the difference between the value of a pixel and the 
median of a region within a 5x5 pixel region surrounding it. We then flagged pixels which had either excessive scatter 
or consistently had values much larger than the median smoothed image, and included these in the rogue pixel map. 
This procedure resulted in similar maps to those generated automatically by the IRSCLEAN_MASK software, but 
was better at flagging more rogue pixels so that we did not have to flag any pixels by hand. Rejected pixels were 
interpolated from the surrounding pixels. 

The cle aned spectra were c oadded and extracted using the Spectroscopy Modeling Analysis and Reduction Tool 
fSMART: iHigdon et al.|[2004f ). The method of sky subtraction depended on the resolution of the data. For the low- 
resolution data, sky subtraction was performed by subtracting one nod position from the other before extraction. For 
high-resolution data, the narrow width of the slit required that separate sky images be subtracted from the QSO 
images in each nod position. A separate set of images 130 arcseconds away from the QSO was taken for this purpose. 
This same procedure applied to a standard star (HR 7341) yielded a spectrum which matched between each of the 
IRS orders/modules, matched the calibration spectrum within 5%, and gave a spectrum which obeyed the Rayleigh- 
Jeans limit, so we are confident of the relative calibration of our data, but expect that the absolute calibration has an 
uncertainty of 5%. We re- reduced the data with later versions of the pipeline which resulted in fluxes that differ by as 
much as 20% in the overlapping region between different orders, while the 14.0.0 pipeline did not have this problem. 

For each order of each module, we fit a Gaussian to the distribution of the difference in flux between the two nods 
divided by the sum of the squares of the uncertainties; in all cases the standard deviation of this distribution differed 
from unity indicating that the uncertainties were misestimated. We scaled the SMART uncertainties by the standard 
deviation of this Gaussian. We then fit the median-smoothed spectrum from all nods and orders with a 5-th order 
polynomial, and cleaned the data of points which disagreed by > 3ct from this fit, as well as points for which the 
two nods disagreed by > 3cr. This procedure automatically removed data near the edges of each order, which are 
notoriously unreliable, and also removed other outliers which may be due to improperly cleaned cosmic rays or rogue 
pixels. 

2.2. IRAC 

QSO 2237-1-0305 was observed for 38.4 sec in each of the four IRAC wavebands (3.6, 4.5, 5.8, and 8.0 /im) in full 
array mode with a 16-position spiral dither pattern with 2-second exposures at each position. Post-BCD mosaics were 
obtained from the Spitzer archive (pipeline version 13.0.2), which we used in our analysis. As QSO 2237-1-0305 is only 
a few pixels across at the resolution of IRAC (see Figure [1]), the four lensed images and lens galaxy are unresolved; 
however, we were still able to derive the flux ratios of the four images. 

2.3. Flux ratios 

For comparison with microlensing models, we derived the fluxes of the lensed quasar images from the IRAC data. 
As the pixel size of the IRAC images, 1'.'2, is comparable to the separation of the quasar images, this required a 
multi-component model fit. Since the relative positions of the quasar images are known extremely accurately, and 
since the IRAC point spread function (PSF) is known fairly precisely, we derived the flux ratios of the four lensed 
images of the quasar with PSF fitting. The main uncertainty in the fitting is the contribution of the lens galaxy 
to the flux in the IRAC bands; we addressed this by using the HST H-band as a model from the CASTLES survey 
iKochanck et al. 2009), assuming no color gradients between the H-band and IRAC bands. This is likely a good 
approximation as extinction and intrinsic colors should vary weakly in the infrared since the stellar emission is well 
into the Rayleigh-Jeans tail. 

We created a model composed of (i) four quasar images; (ii) the one star 10" from the center of the lens galaxy; 
(iii) the lens galaxy flux, scaled from a deconvolved HST _ff-band image; (iv) a uniform sky background. We held the 
relative positions of the quasar images (and the nearby star) flxed to the values derived from the HST H-band image, 
while we allowed the absolute position to vary (given the uncertain absolute pointing of Spitzer). 

The lens galaxy was isolated in the HST H-band image by masking the quasar images and stars (within a circle 
14.25 pixels from the location of each point source), and the masked region for each quasar image was replaced with 
an elliptical Sersic model flt to the remaining H-band data, while the masked region near the star was filled in with the 
median flux near its location. As the Spitzer IRAC PSF is derived at 5 times the pixel resolution (0'.'24), we rotated 
and compressed the HST image to fit the Spitzer images at 5 times the resolution. We then convolved the HST image 
with the IRAC PSF for each band, multiplying by a constant factor to scale to each IRAC band, and added to this 
the five point fiuxes by interpolating the Spitzer IRAC PSF to the location of each point source and multiplying by 
their respective fluxes. Finally we added in a constant flux to represent the sky. 

These model components give a total of nine free parameters to fit: 5 point sources, the extended galaxy flux scaling 
factor, the sky flux, and the RA and DEC of image A, which was taken as the reference point. We computed the 
of this model by comparing with the Post-BCD mosaic and uncertainties from the Spitzer IRAC pipeline. We 
optimized the model parameters using the Levenberg-Marquardt method, and then found the uncertainties on each 
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Fig. 1. — Top left panel: Cutout of 32x32 pixel region centered on the Channel 2 (4.5 /im) mosaic, logarithmic intensity scaling. Top 
right panel: best-fit model to Channel 2 data, including four quasar images, galaxy scaled from HST H-band, and star. Lower left 
panel: difference between data and model. The HST H-band image limits the size of the region in the model to the central 245 pixels. 
Lower right panel: "Deconvolved" model image at 5 times the resolution of Spitzer with the four QSO images and nearby star labelled. 



TABLE 2 

Fluxes of quasar images in milli-Janskys and optical spectral slope. 



Image P<^iV) Oj, F^{3.6^lm) F^{4:.5^lm) F^{5.8fim) F^{8.0^J.m) 

A 0.507 ±0.005 -1.064 ±0.002 1.60 ± 0.07 1.99 ± 0.08 3.21 ± 0.37 4.49 ± 0.22 

B 0.257 ±0.004 -0.859 ± 0.004 1.14 ±0.06 1.73 ± 0.06 2.78 ±0.28 4.19 ±0.19 

C 0.197 ±0.004 -1.374 ±0.005 0.48 ± 0.05 1.25 ± 0.07 1.60 ±0.27 2.67 ±0.16 

D 0.185 ±0.005 -1.335 ±0.006 0.85 ± 0.06 1.28 ± 0.08 2.27 ±0.33 3.76 ± 0.22 

Total A-D 1.15 ±0.01 4.08 ±0.11 6.25 ±0.15 9.86 ± 0.60 15.11 ±0.40 

Star 0.29 ±0.01 0.18 ±0.01 0.13 ±0.05 0.00 ± 0.03 

Galaxy 5.34 ± 0.08 3.52 ± 0.09 2.87 ± 0.35 1.96 ± 0.23 



parameter from a Markov chain computed using the method described in iTegmark et al] (|2004f ). The best-fit for 
the four IRAC channels was (314,209,91,162) for 236 degrees of freedom (9 model parameters to fit the flux of 245 
pixels which is the region covered by the HST H-band image). Formally these fits range from very good to poor, which 
may indicate that the model is inadequate (e. g. possibly the galaxy has color gradients between 2.2 and 3.6 microns), 
or the error bars are misestimated. We also computed error bars on the model parameters using the covariance matrix 
evaluated at the best fit and by finding the region with Ax^ < 1 for each parameter while marginalizing over the 
other parameters; each of these techniques gave error bars nearly identical to the Markov chain. We converted these 
values to fluxes in mJy, as well as flux ratios, and report the derived fluxes and errors in Table [2] For the galaxy we 
report the entire model flux within 5 pixels (6") of the center of the galaxy except for the contribution from quasar 
images. The y-band data is from data taken by the OGL E collaboration one day before the Spitzer observations, and 
the errors reported are relative flux errors, not absolute (jUdalski et al.|[200^ . In addition we report the continuum 
spectral slope, ai, measured at 5400 A for f„ (x for all four images measured with a VLT observation on 11 
November 2005 (jEigenbrod. Courbin. Sluse. Mevlan and Agolll2008| ). 



3. RESULTS 



5 




TABLE 3 

Fluxes ratios of quasar images (IRAC) 



Image V band 3.6 fim 4.5 fim 5.8 8.0 /^m 

B/A 0.507 ±0.010 0.711 ± 0.061 0.869 ± 0.053 0.867 ±0.195 0.932 ± 0.073 

C/A 0.389 ±0.009 0.298 ± 0.040 0.626 ± 0.041 0.499 ± 0.099 0.593 ± 0.043 

D/A 0.366 ±0.010 0.531 ± 0.049 0.643 ± 0.054 0.709 ±0.177 0.837 ± 0.075 

C/B 0.767 ±0.020 0.418 ± 0.051 0.720 ±0.051 0.576 ±0.115 0.637 ± 0.043 

D/B 0.721 ±0.023 0.746 ± 0.080 0.740 ±0.054 0.817 ±0.133 0.898 ± 0.069 

D/C 0.940 ±0.032 1.783 ± 0.319 1.027 ±0.104 1.420 ±0.431 1.411 ±0.146 

A/(A+B+C+D) 0.442 ±0.018 0.394 ± 0.017 0.319 ± 0.011 0.325 ± 0.036 0.297 ± 0.013 

B/(A+B+C+D) 0.224 ±0.009 0.280 ± 0.014 0.277 ±0.010 0.282 ±0.029 0.277 ±0.011 

C/{A+B+C+D) 0.172 ±0.008 0.117 ±0.014 0.199 ±0.010 0.162 ±0.026 0.176 ±0.010 

D/(A+B+C+D) 0.162 ±0.008 0.209 ± 0.016 0.205 ±0.013 0.230 ± 0.034 0.249 ± 0.015 



3.1. Spectral Energy Distribution 

In this section we compare the Spitzer spectral energy distribution of QSO 2237+0305 to Seyfert galaxies and 
quasars to show that it looks like a typical radio-quiet active galaxy in the infrared. Figure [5] shows the Spitzer 
spectrum of QSO 2237+0305 which has been binned so that each bin has a signal-to-noise greater than 100. The 
excellent agreement between the IRS and IRAC results, which had completely independent flux calibration, bolsters 
our confidence in the accuracy of our reported fluxes. 

We fit a power law of the form Fi, cx to the IRS spectrum, and we find a — — 0.96± .02, giving a spectrum which 
is nearly flat in vFi,. 

Figure [3] shows the full spectrum of QSO 2237+0305; the isotropic luminosity is defined as [vL^)rest — 
AT:D\{vF^)ohg/ ^, where rest/obs refer to the rest-frame/observed frequencies, D/, is the luminosity distance of the 
quasar, and /j, is the total magnification of the quasar. Included are our dat a from both IRS and IRA C, optical and 
near- infrar ed data points from the CfA- Arizona Space T elescope LEns Survey (iFalco et al.ll2 0011. OGLE (jUdalski et al.l 
l2006f ). and lEigenbrod. Courbin. Sluse. Mevlan and Agol (,,2008. ) . X-ray data from lDai et al.,, C2003.)i two data point s 
from the GALEX archive (jMartin et al.l[2005[ ). and a sub-mm data point at 850 ^m from lBarvainis and IvisonI (|2002| ). 
To compute the tota l luminosity we have assumed the cosmological param eters from the WMAP 5-year data set 
(jPunklev et al.|[2008f ) as well as a total macrolensing magnification of ^ = 16 (jSchmidt et al.l ll9981. The optical data 
have been corrected for extinction in the Milky Way assuming a Galactic reddening of E{B — V) = 0.068 with Ry = 3.1 
extinction curve. As the light from the quasar passes through different portions of the bulge of the lens galaxy, we need 
to make additional extinction corrections for the four lensed images. We have used the flux ratios of the broad lines 
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rest X (m"^) 

Fig. 3. — Full isotropic spectral energy distribution of QSO 2237+0305. Black line is X-ray data from Chandra, blue points are from 
GALEX, black spectrum near the peak (big-blue bump) is from the VLT, red points are from the Castles database and from OGLE, black 
infrared points are Spitzer data from t his paper, and the black point in the lower right is from ground-based sub-mm observations; green 
line is quasar composite spectrum from lElvis et al] II 19941 ) normalized to the l/im dip. The dotted box shows the region plotted in Figure 

HI 

averaged over time from the data set of lEigenbrod. Courbin. Sluse. Mevlan and Agoll (|2008f ) to derive the relative ex- 
tinction of the four quasar lensed images. We find image B is extincted relative to image A by AE{B — V) = 0.02±0.05, 
while images C and D are reddened with respect to images A and B by AE{B — V) = 0.10 ± 0.04, 0.18 ± 0.03, respec- 
tively. Since images A and B have small (possibly zero) relative extinction, we assume that each of these images has 
zero extinction in the lens galaxy, and simply correct images C and D. We have not attempted to correct the data for 
microlensing, so the overall uncertainty is at least 0.2 mag. From the QSO 2237-f 0305 SED we find a total luminosity 
of Ltot = 4.0 X 10« erg/s. 

In Figur e Bl we plot the SE D of QSO 2237-1-0305, and for reference compare it to the composite radio-quiet quasar 
SED from lElvis et al.l (|1994[ ). normalized to 1.3 /xm. Although QSO 2237-)-0305 appears underluminous in the X-ray, 
UV, mid-IR, and sub-mm relative to the composite, this behavior is well within the range of SEDs in the Elvis sample, 
and it is likely that the composite is affected by selection biases at these wavelengths where many quasars were not 
detected. The SED of QSO 2237-f 0305, shown in Figure [H looks fairly typi cal compared to a composite spectrum of 
Palomar-Green quasars with weak far- infrared emission (jNetzer et al.ll2007l ). There are minor differences such as an 
extra bump near 6-7 /Ltm and a peak associated with the hottest dust at slightly longer wavelengths (~ 2.8±0.3/im), but 
these differences are well within the range of variation within the PG quasar sample. If we "fit" the QSO 2237-1-0305 
SED by scaling the Netzer composite spectrum by an arbitrary factor, we find a, — 573 for 32 DOF, which is 
formally a very poor fit, but the discrepancy is dominated by the disagreement in the cutoff at short wavelengths and 
the bump near 7 microns. 

The infrared spectrum of QSO 2237-1-0305 also looks very similar to low redshift Seyfert galaxies taken from a sample 
of 23 galaxies (Gorjian et al., in preparation). The Spitzer spectrum of QSO 2237-1-0305 is plotted with the two most 
similar Seyfert spectra in Figure [5l scaled to match the flux of QSO 2237+0305. The qualitative shape of the SEDs 
matches well from 4-10 fim, although Mrk 509 shows stronger emission features, presumably due to silicates. 

The similarity of the infrared SED of QSO 2237-1-0305 to other Seyferts and quasars indicates that our microlensing 
studies of this object will broadly apply to radio-quiet active galaxies. 
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Fig . 4. — Comparison of the QSO 2237+0305 SED with the composite infrared spectrum for far-IR weak quasars from iNetzer et ahl 
II2007I) (hght green solid curve). Red filled circles are IRAC data; black connected points with error bars are binned IRS data. 
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Fig. 5. — Black connected points with error bars: IRS spectrum of QSO 2237+0305. Green solid lines: spectra of Seyfert galaxies Mrk509 
(top) and MCG- 2-58-22 (bottom) scaled to match flux of QSO 2237+0305. 
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3.2. Dust emission model 

To improve our physical understanding of the emission from QSO 2237+0305 we fitted the SEP with the models o f 
iFritz et all ()2006f ). The models utilize the Mathis-Rumpl-Nordsicck (MRN) dust size distribution (jMathis et al.lll977f) 
with scattering and absorption opacities from Xaor and Draine (1993.) ■ The model fixes the geometry as a torus with 
an opening angle that is independent of radius and an inner radius of the torus, 

Rmin — l-3pcZ/4g "^ISOO ' i^) 

which is set by a dust sublimation temperature of 1500 K, where L46 is the AGN luminosity in units of 10^^ erg 
s~^. The dust density in the model is described by p{r,9) oc r'^e"'''' ^^^1 , with a dust-free cone within polar-angle 

< Ocone- The grid of models covers a range of parameters for the dust with: (1) the ratio of the outer to inner 
radius, 30 < Rmax/Rmin < 300; (2) the variation of dust density with radius, — 1 < /3 < 1/2; (3) the equatorial optical 
depth at 9.7 /im, 0.1 < rg 7 < 10; (4) inclination angles, i, ranging from 0.01 (edge-on) to 89 degrees (face-on), and 
11,21,31,41,51,61,71,81° in between; (5) dust-free cone with size 20° < Ocone < 60°; and (6) an angular cutoff in 
density with < 7 < 6. 

We have scaled the infrared portion of the models by an arbitrary constant to optimize the match with our Spitzer 
SED. The best-fit model is shown in Figure[6l which has parameters Rmax/Rmin = 100, f3 = —1/2, T9.7 = 10, i = 71°, 
dcone = 20°, and 7 = 3. The best fit implies an AGN luminosity of 4.4 x lO''^ erg s~^, remarkably close to the 
luminosity found from the QSO 22374-0305 SED (L = 4.0 x 10^'^ erg/s), despite the fact that we have only fit the 
Fritz models to the infrared data. This may imply that the dust acts as a fairly good calorimeter of the total AGN 
flux. The qualitative fit to the data is fair: the model shows a peak near 2.5 /Ltm in the rest frame, and gradual decline 
towards longer wavelengths, and an emission feature near 10/xm. However, quantitatively the fit is horrible: = 3045 
for 94 degrees of freedom. This is primarily due to the fact that the short-wavelength peak is more prominent in the 
model than in the observations and the silicate absorption and emission features in the model are offset in wavelength 
of the observed features. It is possible that optimizing the parameters will improve the fit as the grid is quite coarse 
and some of the best-fit parameters are at the extreme values of the grid, such as Ocone- Also, the viewing angle is 
71° which is only 1° within the opening angle of the cone; however, viewing angles of 81° and 89° are very similar in 
shape, but only slightly poorer fits, plotted as dotted lines in Figure [6l Consequently we do not believe that the fitted 
parameters are unique or even correct; indeed the simple geometry chosen by Fritz et al. may be wrong. The main 
point is that a dusty torus model can produce a fair qualitative fit to the SED of QSO 2237-1-0305; further development 
of theoretical models will be required to obtain a better quantitative fit. 

3.3. Measured flux ratios 

Figure [7] shows the ratios of the quasar image fluxes as a function of wavelength from Table [2] The F-band data are 
the data obtained from the OGLE data archive (jUdalski et al.|[2006[) taken a t a tim e closest to our IRAC observations: 
2005 Nov. 16 00:48 UT (HJD). The 10 micron points are from lAgol et al.l (|2000[ ). and thus are not simultaneous to 
our Spitzer obse rvations; however th ere appears to be little variability at this wavelength. Also plotted are the model 
flux ratios from iTrott and Websted ^2002) which is the most complete model of the lens galaxy of QSO 2237+0305 
constructed to date, including the bar and spiral arms. The agreement with the 10 micron flux ratios is expected since 
these were used as a constraint on the model; however, the model gives very similar flux ratios as an earlier model by 

1 Schmidt et al.l ()1998l ) which was constructed before the 10 micron observations. 

The general trend is obvious in Figure [T] for all three pairs of images there is a strong microlensing anomaly in 
the optical which gradually disappears towards longer wavelengths. This is precisely the behavior expected from the 
standard model of quasars: the source should be larger at longer wavelengths and thus less affected by microlensing. 

3.4. Two-component model of flux ratios 

Ideally we would like to use the microlensing flux ratios to constrain the size of the quasar as a function of wavelength: 
howe ver, only weak constraints can be derived without a detection of time variability due to microlensing ( Wyithe et al] 
|2002[) . Instead we use a semi-empirical model for the infrared SED to predict what the flux ratios should be as a function 
of wavelength and compare these predictions with the observed flux ratios to conflrm the plausibility of this model. 

We constructed a semi-empirical model for the flux ratios to compare to the observed IRAC flux ratios as follows. 
We assume that the spectrum consists of a power-law component due to an accretion disk (we are modeling the region 
from 0.4 — 4.0/im which is well longward of the peak of the disk spectrum and is only a decade in frequency, so a 
power-law should be an adequate approximation of a disk spectrum) and a single-temperature thermal dust emission 
component, representing the inner edge of the a dusty torus. We fit the spectral energy distribution from 0.4 to 4.0 
microns in the rest frame with these two components, determining their relative strength at each wavelength. The 
best fit is shown in Figure [U the model provides a good fit to the four IRAC data points. We have not attempted to 
correct for microlensing, nor possible time-variability as the SED data are not simultaneous. However, this will likely 
have a small effect on the SED as summing over all four images reduces the impact of microlensing and in the infrared 
quasars are weakly variable. 

With these two fits we determined the minimum possible source sizes to reproduce the observed flux with thermal 
emission as follows. For the power-law component, we assumed a disk geometry with a temperature that is a power-law 
in radius, r, finding T oc r^"'^^, and found that the half-light radius should scale with wavelength as 

ri/2 = 4 X 10^^cmAl■^ (2) 
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Fig. 6.— Best-fit model from lFritz eFaP (f2003) plotted versus the Spitzer rest-frame spectrum of QSO 2237+0305 as described in the 
text. Data is solid black line connecting points with error bars; best-fit model (71°) is solid red line, while dotted green lines (which appear 
as one line) are the same model viewed at 81° and 89°. 

where A is measured in microns in the rest frame of the quasar. At this radius the standard disk model is weU 
outside the inner edge and thus is expected to have a temperature dependence of T oc r"^/"*, which is close to the 
measured dependence. We assumed that the d ust co mponent either has an emissivity described by optically-thin 
interstellar medium (ISM) dust with the model of lDrai nc (2003) or emits as an optically-thick blackbody (BB). These 
two extremes were chosen to bracket the range of possible behaviors for the hottest dust at the inner edge of the torus 
(we did not use the best-fit Fritz model due to the different peak wavelength). We found best-fit temperatures of the 
dust of 1168 K (ISM) or 1264 K (BB). This requires a minimum distance from the quasar of 3.83 pc (ISM) or 0.76 pc 
(BB) to maintain temperature equilibrium, assuming that the quasar emits isotropically. The total luminosity in this 
component is 6 x 10**^ erg/s (ISM/BB) implying an emission region of at least 1.3 pc (ISM) or 2.2 pc (BB) in size, 
consistent with the radiation equilibrium argument within a factor of 3. 

If the average mass of microlenses is 0.1 Mq, then the Einstein radius is te = 5.77 x IQ-'^^ cm projected to the 
source plane. At one micron the power-law emission (if thermal) is comparable in size to an Einstein radius, while 
the dust emission component is about 200 times larger than the Einstein radius. Thus, the power-law component will 
be affected by niicrolensing, while the d ust component should be nearly unaffected by microlensing, with variations of 
less than 1% ()Refsdal and Stabel]|[l99ll ). 

To model the microlensing of th e power-law component, we extrapolate the wavele ngth dependence of the optical 
flux ratios measured with the VLT (|Eigenbrod. Courbin. Sluse. Meylan and Agoi2008l ) to the infrared. We correct the 
optical flux ratios for differential extinction using the E(B-V) values derived from broad emission lines, as described 
in section Then, the expected flux ratio as a function of wavelength, rsAW, of images A and B is given by: 



(3) 



where /ip'^(A) are the magnifications of the power-law (P) and dust (D) components as a function of wavelength, 
Fp.oW E^re the intrinsic (un- magnified) fluxes of these two components, and ta,_b(A) are the optical depths through 
the lens galaxy for each component (we have dropped A in this equation for simplicity). 
Now, as argued above, the dust component should be large enough to be unaffected by microlensing, so rsA.D — 
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Fig. 7. — Flux ratios of QSO 2237+0305 as a function of wavelength. Red points show flux ratios of image B to image A, blue are ratio 
of image C to image A, and green are ratios of image D to image A. The V-band data (left-most points) are taken from the OGLE website 
at the time closest to the I RAQ observations; the 11 micron data are taken from lAeol et al.i (l2000i ). The dotted lines are the flux ratios 
predicted from the model of lTrott and Websteil ||2002I '1. 

A^ n/j^n can be derived from a model for microlensing. We utilize the model of lSchmidt et al.l (I1998D. im proved upon 
bv lTrott and Websteil ()2002f ) , with more recent modifications based on kinematic data |Trott et al mm . The relative 



strengths of the power-law and dust components we take from the model of the spectral energy distribution (Figure 
13), fop — FdW/ Fp{X); as mentioned above; as the SED is constructed from the sum of all four images it should 
be affected only weakly by microlensing. Finally, the wavelength dependence of the microlensing magnification we 
take from the extinction-corrected optical fiux ratios measured with the VLT, tba.p = f^p/lJ'p- For the extinction 
correction we assume a Milky Way extinction curve with R = 3.1. Dividing through the numerator and denominator 
by ^pFp., we can rewrite the above equation as 



rpA.P + {f^o/f^pyBA.DfDP - 

rsA = — ■ — — — —- — '■ e 



■rB+TA 



1 + (m^/m^)/. 



DP 



(4) 



The remaining unknown in this equation is /z^ / Hp{\) since the microlensing magnification of the power law component 
for each image is unconstrained by our data. Fortunately the left hand side of this equation is weakly dependent on 
this ratio. We use the size distribution derived from the power-law emission spectrum (equation [2]) to compute the 
probability distribution as a function of wavelength from microlensing simulations of each image using the macrolensing 
parameters from the model of Schmidt ct al. ( 1998 ). We utilized the code of W ambsganss, (1999 ) to run the simulations, 
creating 10 simulations for each image of a size 20rE x 2QrE, and then convolving the magnification pattern with a 
Gaussian source with the same half-light radius as derived in equation [2l 

To predict the flux ratios at IRAC wavelengths, we have run 10^ Monte Carlo simulations sampling the relative 
extinction, the optical fiux ratios, the galaxy macro-lensing model fiux ratios, and the relative microlensing magnifi- 
cation for the two images within the uncertainties of each parameter. We have then computed the median and 68% 
confidence limits at each wavelength from these simulations, which is plotted in Figure [9l for the ratio of images B 
to A, as well as several other image pairs. We find good agreement between the model predictions and the observed 
fiux ratios. For all but 3 fiux ratios the data are within the 68% confidence limits of the model fiux ratios. Thus the 
wavelength dependence of the fiux ratios is consistent with the accretion disk/dusty torus model. 
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Fig. 8. — Rest-frame spectral energy distribution of QSO 2237+0305 fit from 0.4-4 fim with a power-law (red solid line) and thermal 
dust emission (green solid line) component. The black solid line is the best-fit sum of both components. 
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Fig. 9. — Flux ratios of images B/A, C/B, D/A and D/B versus observed wavelength (crosses) 
(dashed) prediction of microlcnsing model of accretion disk and dusty torus. 
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4. DISCUSSION 

The primary two results in this paper are: (1) a measurement of the infrared spectral energy distribution of the 
Einstein Cross; (2) a derivation of the infrared flux ratios in the mid-infrared (3.6, 4.5, 5.8 and 8.0 fim observed; 1.3, 
1.7, 2.2 and 3.0 /im in the rest frame) and comparison to a microlensing model. We discuss the implications of these 
results in this section. 

4.1. Comparison with prior work 

lAgol et aD ()2000l ) observed of QSO 2237+0305 with the Long Wavelength Spectrometer on the Keck I telescope at 
8.9 and 11.7 fim; the flux ratios at these two wavelengths were identical within the errors, and both agreed with the 
macrolcnsing flux ratios as predicted by the best lens models. The flux at 11.7 /um reported in that paper agrees well 
with our IRS spectrum; however, the 8.9/im Keck data point is higher than the IRS data by about 40% indicating that 
the Keck data had an incorrect calibration. We have not tracked down the source of this discrepancy, but it cannot 
be due to microlensing which would have caused a difference in the flux ratios at 8.9 and 11.7/im. We have more 
confidence in the calibration of the Spitzer spectrum; consequently the Keck flux at 8.9 /xm was likely in error. The 
qualitative agreement between the Spitzer SED and the Netzer composite indicates that QSO 2237+0305 behaves as 
a typical quasar in the near infrared. 

4.2. QSO 2237+0305 SED 

As menti oned above, the Q SO spectral energy distribution. Figure [6l is qualitatively well fit with an AGN torus 
model from lFritz et al.l ()2006f ): however, the publicly available models have a fixed temperature for the inner edge of 
the torus at 1500 K which is somewhat higher than the temperature of the inner edge we have estimated (1164-1250 
K). In addition, the silicate spectral feat ures of the model are a poor fit to the SED, as is commonly seen in comparing 
dusty torus models to AGN SEDs (e.g. iNenkova et al.l[200^ . We have found that some Type II models (obscured 
quasar) fit the silicate feature well, having an offset silicate feature due to radiation transfer effects, while these models 
do not fit shorter wavelengths which are obscured. Thus, it may be possible that the dust composition and/or torus 
opening angle changes with radius causing the silicate feature to appear more like that of Type II quasars, while 
allowing the inner edge and quasar to be visible so that shorter wavelengths look more like a Type I quasar. Another 
possibility is that the silicate properties are modified near quasars, either due to c hanges in the grain size distributio n 
or due to grain porosity, causing the silicate spectral features to be shifted (e.g. IVoshchinnikov and Henning||2008^ . 
Both possibilities need to be explo red in f uture models of dust grain opacity, as well as computing better physical 
models for the dusty torus, such as iKrolid ()2007f ). 

4.3. Microlensing, flux ratios, and spectral components 

The agreement between the measured and predicted flux ratios is quite good, close to 1 — tr for all data points 
except one (Figure [5]). The uncertainties in the flux ratio predictions are highly correlated between all wavelengths 
since microlensing and extinction have a monotonic variation with wavelength, so the case of the ratio of images C 
to B (for example) is still highly probable. It is clear from Figure [9] that an extrapolation of the optical power law 
(which can be seen shortward of 1 micron) does not do a good job of predicting the infrared flux ratios, while including 
the un-microlensed infrared bump due to dust emission brings the flux ratios back into agreement with the data 
(within one standard deviation). If the power-law component had a cutoff around one micron so that the infrared data 
were solely due to the extended dust emission, then the flux ratios would change abruptly to the macrolensed values. 
This is not the case for, e.g., the ratio of image D to image A, indicating that both the power-law and dust emission 
components are required to fit the IRAC data. This adds evidence to the case for the presence of an accretion disk 
emission component under the infrared bump. 

By extrapolating the wavelength dependence of the microlensing flux ratios from the optical, we mostly avoided 
needing the size of the accretion disk in units of the Einstein radius. The only place the size of the accretion disk 
enters our analysis is in computing the flux ratios of the disk and torus components for each image in equation [4] 
{li"^/lip, and the same ratio for images B — D). However, this factor cancels out when either the disk or torus 
components dominate the flux, so our results are weakly sensitive to our choice of source size (equation [J) and Einstein 
radius (M = Q.IMq). 

5. CONCLUSIONS 

It has long been hypothesized ([Sanders et al.|[l989r ) that the near-universal dip near 1/im in quasar spectral energy 
distributions is due to the cutoff in emission of the dusty torus at short wavelengths due to dust sublimation close 
to the quasar. Even if both the disk and dust emit as blackbodies, the disk emits at a higher temperature and has 
a smaller flux than the dust and thus is much more compact in size, by a factor of about ~ 100 at 1 /im in the rest 
frame. The compact infrared disk emission should be more strongly affected by microlensing than the extended dusty 
torus infrared emission. This trend is conflrmed by the wavelength dependence of the flux ratios in the IRAC bands for 
QSO 2237+0305 (Figure [3]), and is in good quantitative agreement with our prediction of the wavelength dependence 
of the flux ratios assuming a two-component model, accretion disk plus dusty torus. Since the infrared SED of QSO 
2237+0305 looks very similar to a standard quasar and similar to some low redshift Seyferts (Figures |4] and [5|) , this 
result conflrms the model that quasars contain an accretion disk an d dusty torus. Indeed, a similar behavior of the flux 
ratios was found for the two-image lensed quasar HE 1104-1805 bv lPoindexter et al.l (|2007f) with optical, near-infrared 
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and Spitzer observations: in the infrared the microlensing anomahes disappear. iPoindexter et all (|2008D modeled the 
source size for this quasar as a power-law with wavelength rather than with a two-component model; due to the lack 
of variability at the IRAC wavelengths their derived size of ~ 3 x 10^^ cm is actually a lo wer limit on the s ource size, 
and thus is consistent with a dusty torus model. Unified models for active galaxies (e.g. lAntonuccilll993D require a 
dusty torus for obscuration of low-polarization Type I AGN to create higher polarization Type II AGN, while our 
result prov ides additional ev i dence for the unified model in an unobscured quasar. 

Recently iKishimoto et al.l (|2008D have demonstrated the co-existence of the accretion disk and dusty torus compo- 
nents near one micron using infrared spectropolarimetry. In total flux the accretion disk component is masked by the 
stronger unpolarizcd thermal dust emission at wavelengths longer than ^ 1/im. Since the accretion disk is emitted 
from a small scale, it can be polarized by scattering off of gas within the dus ty torus, while the thermal emission 
from the dusty torus is unpolarized since it is exterior to the scattering region. .Kishimoto et al.l (|2008il have detected 
a polarized component with a power-law shape which extends into the infrared, which they identify with polarized 
emission from the accretion disk, thus demonstrating the contribution from both the disk and torus near the one 
micron dip. Our results provide a complementary confirmation of the results of IKishimoto et aTl ((2008). 

There are two primary areas which require improvement over the current work: (1) time-dependent monitoring at a 
broad range of wavelengths to derive the relative size versus wavelength from the microlensing, rather than deriving a 
size versus wavelength from the SED and then predicting the microlensing behavior as we have done; (2) computing 
physically complete dusty torus SEDs coupled to accretion disk models. 
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